Apparatus for video communication

ABSTRACT

An apparatus for video communication connected to a local processing device executing a video conferencing software for communicating with a remote location, the apparatus being configured to be locally wirelessly connected to a plurality of mobile devices each comprising a display and a video camera, to receive a plurality of video signals related to a plurality of participants in a video conference generated by the video cameras of the plurality of local mobile devices, to generate a video stream based on said plurality of video signals received from the plurality of local mobile devices, to transmit the video stream to said local processing device, to receive from the local processing device an output signal representing a content displayed by a display of the processing device when executing the video conferencing software and to share with the plurality of local mobile devices the output signal.

TECHNICAL FIELD OF THE INVENTION

The present invention relates to an apparatus for video communication,particularly for video conferences.

BACKGROUND OF THE INVENTION

So-called “endpoint” or “room” or “group” apparatuses dedicated to videoconferences with multiple participants present simultaneously in meetingrooms and the like are known.

Generally, the known apparatuses are provided with a camera, one or morescreens, microphones and loudspeakers and are installed by specializedoperators inside the rooms they must serve.

However, such known apparatuses are generally very costly and havelittle flexibility, since they are not easily and rapidly adaptable tothe different needs of users.

A known alternative to this type of apparatuses is the use of systems ofthe “desktop” type, made up of a PC, a monitor, loudspeakers, a webcam,a microphone and, naturally, software for video conferences.

This type of apparatuses is very economical and quite effective if auser wants to take part in a video conference, for example, from theirdesk, but does not effectively meet the needs of a video conference in aroom with multiple participants.

In fact, in the case of use in a meeting room, the PC is generallyconnected to a television set placed at one end of the meeting room, andon which the webcam is mounted. The microphone is commonly connected tothe PC and placed on top of a table in the meeting room.

Due to the limited framing angle of a webcam, which in general is 90° atmost, it is easy to understand that the aforesaid solution largelyundermines the effectiveness of the video conference, since the majorityof people taking part in it are either not clearly visible due to theexcessive distance or are even not captured by the webcam and, as aconsequence, it is possible only to hear their voice.

In order to remedy this limitation, sometimes the conferenceparticipants move closer to one another, getting into uncomfortablepositions, and above all considerably reducing the effectiveness ofcommunication in the video conference session.

The patent application US 2013/0307919 A1 refers to video conferencingequipment. This document describes a multiple camera (multi-camera)device having a plurality of input ports, each configured to receivevideo information when connected to a respective external video camera,and an output port capable of outputting video information. Themulti-camera device includes also a selection component configured toprovide video information from at least one of the plurality of inputports to the output port for output to an external computer. Theselection component may produce composite video information from videoinformation received from multiple input ports. Whatever videoconferencing software may be installed on the external computer (e.g.,Skype, WebEx, or AdobeConnect, etc.). Audio/video information frommultiple cameras, either as video from a single camera that changes asthe speaker changes or as composite video information displayingmultiple participants, may be transmitted to a remote location via oneor more networks to which the external computer is connected. Despiterepresenting an improvement with respect to other known techniques, thesystem described in US 2013/0307919 A1 is still not satisfactory,because the plural participants to the conference in a room cannotalways properly follow the interventions of the participants in theremote location, due to the fact that the participants in the remotelocation are only displayed on the screen of the external device runningthe video conferencing software. In particular, when the screen of theexternal device running the video conferencing software is small andthere is a large number of participants attending the conference,following the interventions of the participants in the remote locationis difficult and inconvenient. The technique of US 2013/0307919 A1 isapplicable to the case of an impromptu video conference, i.e. a videoconference carried out in the absence of an installed dedicated videoconference equipment (such as a large TV set or screen). In such casesof impromptu video conference, the solution of the prior art isparticularly dissatisfactory, because it is difficult to properly followthe intervention of the participants at the remote location as abovementioned. As a further drawback of the prior art, the gaze of a localparticipant is normally directed towards the screen of the laptop forwatching the remote participants, and not towards the respective videocamera capturing the image of the local participant; as a consequence,the remote participants do not have the impression that the localparticipants are looking towards them during the conference; in otherwords, no eye-to-eye contact can be established between the localparticipants and the remote participants. This renders the conferenceeven less effective.

SUMMARY OF THE INVENTION

The main task of the present invention is to devise an apparatus forvideo communication that serves to improve the effectiveness ofcommunication in a video conference session by effectively managing theframing of the participants present in the room.

Another object of the present invention is to devise an apparatus forvideo communication which enables the various participants to remaincomfortably seated in their places inside the room throughout the entirevideo conference session.

A further object of the present invention is to devise an apparatus forvideo communication which makes it possible to overcome theabove-mentioned limitations of the prior art within the scope of asimple, rational solution that is easy and effective to use and has amodest cost.

A further object of the present invention is to provide an apparatus anda system for video communication which permit the local participants toproperly follow the interventions of the participants at the remotelocation, also in case of impromptu video conference.

A further object of the present invention is to provide an apparatus anda system for video communication which permit to establish an eye-to-eyecontact between the local participants and the remote participantsduring the video conference.

The objects set forth above are achieved by an apparatus for videocommunication configured to be locally connected to a local processingdevice executing a video conferencing software for communicating with aremote location, the apparatus being further configured to be locallywirelessly connected to a plurality of local mobile devices eachcomprising a display and a video camera, the apparatus comprising:

-   -   receiving means configured to receive a plurality of video        signals related to a plurality of participants in a video        conference generated by the video cameras of the plurality of        local mobile devices,    -   generating means operatively connected to said receiving means        and configured to generate an output video communication stream        based on said plurality of video signals received from the        plurality of local mobile devices, and    -   transmitting means configured to transmit the output video        communication stream to said local processing device executing        the video conferencing software,

wherein the apparatus is configured to receive from the local processingdevice an output signal representing a content displayable by a displayof the processing device when executing the video conferencing software,wherein the apparatus comprises sharing means configured to share withthe plurality of local mobile devices the output signal representing thecontent displayable by the display of the processing device.

In this aspect, the apparatus shares (i.e., transmits) the contentdisplayable by a display of the processing device executing the videoconferencing software (i.e. a content including or constituting thevideo images received at the processing device from the remote locationby means of the video conferencing software) to all mobile devicesinvolved in the conference. This function can be achieved by mirroringthe display of the processing device executing the video conferencingsoftware at the displays of all mobile devices involved in theconference. Thus, all participants to the conference in the room canproperly follow the interventions of the participants in the remotelocation by means of their respective mobile devices, also in case ofimpromptu video conference and in the absence of a large TV set orscreen in the room. Furthermore, an eye-to-eye contact between the localparticipants and the remote participants can be achieved during thevideo conference, because the participants look at their own mobiledevices during the conference and the mobile devices capture the imagesof the participants.

According to a further aspect, the apparatus is wirelessly connectedwith the local mobile devices via an IEEE 802.11 interface for receivingthe video signals generated by the video cameras of the plurality oflocal mobile devices and for sharing the output signal representing thecontent displayable by the display of the processing device.

According to a further aspect, the apparatus is wirelessly connectedwith the local processing device executing the video conferencingsoftware via an IEEE 802.11 interface for transmitting the output videocommunication signal to the local processing device and for receivingthe output signal representing the content displayable by the display ofthe processing device. According to a further aspect, the apparatus isfurther configured to receive from the local processing device a videosignal generated by a video camera associated to the local processingdevice, the generating means being configured to generate said outputvideo communication stream also based on the video signal received fromthe local processing device. In this aspect, the available resources areused efficiently, by involving also the local processing deviceexecuting the conferencing software in the generation of the singleoutput video communication stream.

According to a further aspect, the generating means are configured tocombine said plurality of video signals received from said plurality oflocal mobile devices into a single video communication streamrepresenting a video including a patchwork of moving images captured bydifferent local mobile devices.

The present invention further provides a system for video communicationcomprising: an apparatus as above mentioned, a local processing deviceexecuting a video conferencing software for communicating with a remotelocation, and a plurality of local mobile devices each comprising adisplay and a video camera,

the apparatus being locally connected to the local processing deviceexecuting the video conferencing software, and being locally wirelesslyconnected to the plurality of local mobile devices.

According to a further aspect, each of local mobile devices is anyoneamong a notebook, a laptop, a mobile phone and a tablet. In this aspect,all participants in the room can properly follow the conference by usingtheir own mobile devices.

According to a further aspect, the local processing device executing avideo conferencing software is anyone among a notebook, a laptop, apersonal computer, a mobile phone and a tablet.

According to a further aspect, the local processing device, whenexecuting the video conferencing software, is configured to exchangevideo conferencing signals including the output video communicationstream with the remote location via internet, wherein the localprocessing device, when executing the video conferencing software, isconfigured to access the internet via the apparatus. According to thisaspect, when carrying out the video conference, the local processingdevice can use only one single wireless connection (such as a WiFiconnection) to the apparatus both for communicating with the remotelocation and for exchanging signals locally with the mobile devices ofthe participants in the room. This allows using devices havingcapability for establishing only one wireless connection, as the hostdevice for running the video conferencing software.

According to a further aspect, the invention provides an apparatus forvideo communication, able to produce at output a video communicationstream, comprising processing means and operatively associable withimage acquisition means able to produce at least a video signal relatedto a plurality of participants in a video conference, comprising:

recognition means to recognize at least a speaker among saidparticipants in the video conference, starting from said at least onevideo signal, and

generating means of said video communication stream operativelyconnected to said processing means and to said recognition means andable to generate said video communication stream starting from said atleast one video signal and depending on the signal at output from saidrecognition means.

According to a further aspect, the apparatus is configured for receivinga framing command from one of mobile devices or from the processingdevice, the framing command being inputted by a user of the mobiledevice or of the processing device, wherein the generating means areconfigured to change a mode of generating the output video communicationstream depending on a recognition of a speaker based on the receivedframing command. This allows to safely identify the speaking personduring the video conference.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Other features and advantages of the present invention will become moreapparent from the description of a preferred but not exclusiveembodiment of an apparatus for video communication, illustrated by wayof a representative, non-limiting example in the appended drawings, inwhich:

FIG. 1 is a schematic view of the apparatus according to the inventionapplied to a generic video conference system;

FIG. 2 is a schematic representation of the operation of the apparatusaccording to the invention;

FIG. 3 shows an illustrative implementation of apparatuses usedaccording to the invention.

With particular reference to the figures, 1 denotes in its entirety anapparatus for video communication.

Within the framework of the present description, reference will be madeto an “apparatus for video communication”, particularly for videoconferences; this should not be construed as limiting, since theapparatus 1 can also be used in other forms of video communication.

The apparatus 1 has been conceived to be placed on a surface 2, such as,for example, a table in a meeting room. Alternatively, the apparatus 1might also be installed in a disguised manner in a room.

As can be seen from the figures, according to a preferred embodiment theapparatus 1 has a substantially cylindrical outer body 3. However,different shapes and sizes of the body 3 are not ruled out.

The apparatus 1 can be connected to a personal computer 4 of aconventional type, such as a desktop computer, a laptop computer or anyother computer of a known type, provided with video conferencingsoftware of a conventional type (such as Skype or the like).

Advantageously, the apparatus 1 is able to produce as output a videocommunication stream 6 suitable for being sent to the personal computer4 and, subsequently, by the video conferencing software on the personalcomputer itself, for example over the conventional Internet network,towards a remote screen 5 used by one or more remote participants takingpart in the video conference session.

Optionally, the personal computer 4 is further connected to a screen 5′present inside the room, such as a television set or a monitor, whichcan be used to display the images received from remote participants.

Alternatively, the apparatus 1 might be connected to the screen 5′, forexample via HDMI cable, so that the apparatus 1 may provide the videoconferencing signal received from the remote location to the screen 5′for displaying thereof (not shown in the figures); in this manner, thepersonal computer 4 might not need any cable connections for the purposeof the video conference, if connected to the apparatus 1 via a wirelessinterface. According to the invention, the apparatus 1 comprisesprocessing means 7 and is operatively connectable to image acquisitionmeans 11 adapted to produce at least a video signal 9 related to theparticipants in the video conference.

Advantageously, the image acquisition means 11 connectable to theapparatus 1 can be multiple and of different types.

For example, the image acquisition means 11 can comprise:

-   -   at least a camera 11 such as a webcam or the like;    -   at least a camera 11 of at least one mobile device 10, such as,        for example, a smartphone or a tablet;    -   at least a camera 11 of at least one portable personal computer,        such as a laptop or the like.

Therefore, the apparatus 1 supports the simultaneous use of a pluralityof cameras 11, also of different types and built into different devices.

One or more webcams 11 can for example be associated with the body 3 ofthe apparatus 1 or, alternatively, they can be rested upon or in someway fixed to the table in the room in which the video conference isbeing held. If the cameras 11 of one or more mobile devices 10 are used,each mobile device 10 is equipped with management and control means 14,preferably consisting of specific software for managing and sending theimages picked up by the mobile device itself.

Usefully, the management and control means 14 can be provided withfunctions for controlling the sending of the captured image.

For example, the control functions can enable the captured images to bemanaged, for example functions of momentarily stopping the transmissionof the images and/or functions for selecting the cameras to be usedamong those built into the mobile device 10 (for example, the frontand/or back camera).

Preferably, the image acquisition means 11 comprises a plurality ofcameras 11 configured to capture images of the different participants inthe video conference. However, it is not ruled out that images of thedifferent participants can be captured with a single camera 11.

Advantageously, according to the invention, the apparatus 1 comprisesrecognition means 13 to recognize at least one speaker among all theparticipants in the video conference.

In detail, the recognition means 13 comprises at least a suitablesoftware module for processing the video images of the participants inthe video conference in order to identify the speaker.

In particular, the recognition means 13 can comprise at least a softwaremodule for detecting the lip movements made by the speaker.

Different implementations of the recognition means 13 are not ruled out,however.

For example, the recognition means 13 can comprise at least a softwaremodule for detecting a gesture of the participants in front of thecamera. This gesture can be, for example, the raising of a hand by oneof the participants. In this manner, therefore, during the videoconference it will always be possible for each of the participants tohave themselves framed (i.e. become the speakers) by making a certaingesture in front of the camera. Alternatively, the apparatus 1 isconfigured for receiving a framing command from one of mobile devices 10or from the processing device 4, whereby the framing command is inputtedto the mobile device or to the processing device by its user by means ofa software interface installed on the respective device. In this case,the generating means 17 can be configured to change a mode of generatingthe output video communication stream depending on a recognition of aspeaker based on the received framing command. Thus, the recognitionmeans 13 may recognize the speaker based on the framing command; thisallows to safely identify the speaking person during the videoconference. Preferably, only one among the mobile devices 10 and theprocessing device 4 is entitled to send the framing command to theapparatus 1; in this manner, a single local user might properly controlthe generation of the output video communication stream to betransmitted to the remote location.

The recognition means 13 are operatively connected to processing means7. Advantageously, the apparatus 1 further comprises generating means 17for generating the video communication stream 6, operatively connectedto processing means 7 and recognition means 13 and adapted to generatethe video communication stream 6 starting from the video signals 9acquired by means of the cameras 11 and depending on the signal outputfrom said recognition means 13, i.e. depending on the speaker detected.

In particular, the generating means 17 can comprise a mixing moduleadapted to compose the video streams originating from the imageacquisition means 11.

In other words, the mixing module is adapted to perform a so-called“mixing”, i.e. to integrate the plurality of the video signals 9originating from the individual cameras 11 into a single video stream.

In detail, the integration, i.e. the composition of the video streamsinto a single video stream, takes place depending on the criteria andspecific needs of a user.

In particular, the apparatus 1 captures the video images from all thedevices connected to it, crops the images around the face of theparticipant and is capable of identifying which one of them is actuallyspeaking.

Once it has collected the video images, the apparatus 1, by means of themixing module, is capable of composing the video communication stream 6toward the personal computer 4. Depending on the specific layoutselected by the user, the video communication stream 6 can comprise:

-   -   displaying exclusively the video signal 9 relating only to the        speaking participant identified;    -   the mixing of the video signals 9 of all participants with the        speaker highlighted (the composition of the video image can be        achieved in different ways);    -   the mixing of the video signals 9 relating to all participants        with equal emphasis (all the video images are conveyed in the        same video communication stream without any difference between        speakers and non-speakers).

The selection might occur also during the conference by means of asoftware interface installed on the personal computer 4.

For example, with reference to mixing with the speaker highlighted, thevideo image of the speaker can take on a specific positioning inside thescreen 5 and, moreover, has larger dimensions than the video imagesrelating to the other participants in the video conference.

This makes it possible not to exclude the other participants in thevideo conference, but rather to modulate the dimensions and positioningof the respective videos within the screen 5 (and optionally within thepreview on the screen 5′ inside the room) depending on how the videoconference proceeds.

In other words, when there is a switch from one speaker to another, thevideo images of the person who has just started speaking take on alarger size and occupy the main position inside the screen 5 compared tothose relating to the previous speaker, which take on a secondaryposition.

However, different configurations with different formats can be adoptedas regards both the resolution and disposition of each video imageinside the screen 5.

It should be noted, moreover, that from the point of view ofimplementation, the selection of the particular mixing method can bemade by the user of the personal computer 4 which manages the videoconference, by appropriately selecting a specific video driver of theapparatus 1 from among the following specific types of layout:

-   -   driver 1: speaker only;    -   driver 2: mixing with speaker highlighted;    -   driver 3: mixing with equal emphasis on all participants.

Usefully, if for any reason the apparatus 1 is not capable ofdetermining with certainty which of the participants is the speaker, theapparatus 1 will automatically select the mixing mode with equalemphasis on all participants.

Advantageously, the processing means 7 comprises one or more processingunits, and are adapted to produce as output the video communicationstream 6.

It is worth pointing out that, within the scope of the presentdiscussion, “video communication stream” means the signal whichpreferably comprises both a video signal 9 and an audio signal 12.

Preferably, the video communication stream 6 is processed by thegenerating means 17 inside the apparatus 1 and can be transmitted via aphysical output such as a USB port, an antenna, or the like to thepersonal computer 4 and, therefore, to the management software of thevideo conference. However, different embodiments in which the generatingmeans 17 are implemented in a device separate from the apparatus 1, forexample on a personal computer 4, are not ruled out.

The management system of the apparatus 1 comprises management andcontrol means 14 operatively connected to processing means 7 andsuitable for the management and control of the video signal 9 acquiredby camera 11 of each mobile device 10.

Advantageously, the management and control means 14 comprises at least asoftware program installable on each mobile device 10. In particular,the software program is adapted to manage the communication with theprocessing means 7 of the apparatus 1.

Advantageously, the apparatus 1 comprises sharing means 18 for sharingthe desktop of the personal computer 4 connected in the videoconference. In particular, the sharing means 18 is operatively connectedto the management and control means 14 of each mobile device 10, for thesharing, on the mobile devices, of an output signal 15 originating fromthe personal computer 4 and relating to the desktop of the personalcomputer 4 adapted to manage the video conference.

The use of the sharing means 18 is to be considered particularlyadvantageous in the event that in the meeting room there is no screen 5,such as, for example, a TV or a large monitor, on which all theparticipants can see the remote participants and the shared documents.

In particular, according to a possible embodiment, the apparatus 1comprises a video input (for example, of the HDMI, DVI or VGA type) andthe personal computer 4, on which the program used for the videoconference (for example Skype) is being run, is connected via a videooutput (HDMI/DVI/VGA) to said video input. Therefore, the output signal15 originating directly from the video output of the personal computer 4is transmitted to the video input of the apparatus 1 and, from there, istransmitted via the sharing means 18 to each of the mobile devices 10.

In practical terms, therefore, users participating in the videoconference with a mobile device 10 will send (if they want) their imageand audio captured by the mobile device 10 to the apparatus 1 and canreceive and display on the mobile device the desktop of the personalcomputer 4 and, therefore, the video image of the remotely connectedparticipants, as well as any documents shared in the video conferencesession.

With reference to a possible alternative embodiment, the personalcomputer 4 on which the program used for the video conference is beingrun (for example Skype) is connected via USB, Ethernet or Wi-Fi to theapparatus 1. In such a case, on the personal computer 4 there is anacquisition module 19 operatively connected to the sharing means 18 onthe apparatus 1 and adapted to perform a screen-capture of the desktopof the personal computer 4. For example, the acquisition module 19 canconsist of specific client software installable in the personal computer4.

The video images thus captured are then encapsulated in a stream formatthat is sent from the personal computer 4 to the apparatus 1, and fromthe latter to all the connected mobile devices 10.

In practical terms, users who participate in the video conference with amobile device 10 can send their image and the audio captured by themobile device 10 to the apparatus 1 and can receive and view on themobile device itself the image of the desktop of the connected personalcomputer 4, and thus the video of the remotely connected participantsand any documents shared in the video conference session.

Finally, the apparatus 1 comprises sound acquisition means 16 foracquiring the voice signal of the speaker.

Advantageously, the sound acquisition means 16 can comprise:

-   -   at least a microphone 16, which may be positioned, for example,        on the table in the meeting room;    -   at least a microphone 16 connected to one of the mobile devices        10;    -   at least a microphone 16 built into the mobile devices 10;    -   at least a microphone 16 built into a webcam 11;    -   at least a microphone 16 built into the apparatus 1.

In particular, the apparatus 1 enables the use of the microphone 16built into a mobile device 10. This makes it possible to avoid the useof one or more microphones specifically dedicated to the videoconference, as well as enabling an optimal sound acquisition for each ofthe participants.

Usefully, moreover, the apparatus 1 can have at least one USB port forthe connection and recharging of the mobile devices 10 used by the userssitting at the table.

Finally, the configuration of the apparatus I (possible indication ofthe local network connected to, settings, activation of sources,limitations imposed, preferences and other parameters) can be carriedout by means of a specific software application installable in apersonal computer 4 or a mobile device 10.

In conclusion, it is specified that the video communication system,meant as the combination of hardware and software capable of completelymanaging a video conference, comprises the above-described apparatus 1,the above-described image acquisition means 11, operatively connected tothe apparatus 1 and adapted to produce at least a video signal 9 relatedto a plurality of participants in a video conference, and the personalcomputer 4 operatively connected to the apparatus 1 and provided withvideo conferencing software.

The operation of the present invention is as follows.

During a video conference, the detection module 13 processes the videoimages acquired from the cameras 11 and detects the lip movements (orother possible predefined movements and gestures) of the speaker.

The speaker's voice is picked up by a microphone 11 built into aportable device 10 or, alternatively, by a special dedicated microphone1 I placed so as to rest on a surface 2 in proximity to the speaker.

The output audio signal 12 and video signal 9 are directed to thegenerating means 17, which in turn integrate them so as to generate thevideo communication stream 6 output from the apparatus 1.

In particular, the video communication stream 6 is generated accordingto the specific mode of mixing the video signals 9 selected by the user.The video communication stream 6 is then directed to the screen 5 of oneor more remote participants which, for example, can display the speakerin the foreground and the other participants in the video conference inthe background.

Optionally, via the sharing means 18, operatively connected tomanagement and control means 14 of each mobile device 10, it is possibleto share, on the mobile devices, one or more video signals 15originating from the video conferencing software on the personalcomputer 4 and relating to one or more remote participants in the videoconference.

FIG. 3 shows an illustrative implementation of a computer system 300that may be used to implement one or more of the above describeddevices, for example to implement the above described apparatus 1. Thecomputer system 300 may include one or more processors 310 and one ormore non-transitory computer-readable storage media (e.g., memory 320and/or one or more non-volatile storage media 330). The processor 310may control writing data to and reading data from the memory 320 and/orthe nonvolatile storage device 330 in any suitable known manner.Processor 310, for example, may form the processing means 7, thegenerating means 17 and the recognition means 13 provided as part of theapparatus 1; the processor 310 may perform the functionality abovedescribed for the processing means 7, the generating means 17 and therecognition means 13. To perform the functionality above described ofthe apparatus 1, the processor 310 may execute instructions stored inone or more computer-readable storage media (e.g., the memory 320,storage media, etc.), which may serve as non-transitorycomputer-readable storage media storing instructions for execution byprocessor 310. The computer system 300 includes an input/outputfunctionality 340 to receive data and to provide data, and may include acontrol apparatus to perform I/O functionality. In particular, thecomputer system 300, when implementing the apparatus 1, includes one ormore antennas for receiving/transmitting video signals from/to themobile devices 10 and the local processing device 4. The function of theabove described sharing means 18 may be performed by the 110 340 underthe control of the processor 310.

According to an embodiment, the generating means 17 and the recognitionmeans 13 may be part of the processing means 7 of the apparatus 1.According to an embodiment, the function of the generating means 17 maybe performed by the processing means 7 of the apparatus 1 and thegenerating means 17 may be absent. According to an embodiment, thefunction of the recognition means 13 may be performed by the processingmeans 7 of the apparatus 1 and the recognition means 13 may be absent.According to an embodiment, the above described personal computer 4 canbe replaced by a mobile phone, a tablet or a laptop executing the videoconferencing software (such as Skype, etc.).

It has been ascertained in practice that the described inventionachieves the set objectives.

In particular, it should be emphasized that the apparatus for videocommunication according to the invention serves to improve theeffectiveness of communication in a video conference session through thereal-time identification of each individual speaker and an effectivemanagement of the framing of the participants present in the room.

Furthermore, the apparatus according to the invention enables the use ofthe cameras and microphones of mobile devices, thus ensuring that thevideo images and sounds of the participants can be effectively picked upwithout any additional cost.

This also permits the various participants to remain comfortably seatedin their places inside the room throughout the entire video conferencesession.

1. Apparatus (1) for video communication configured to be locallyconnected to a local processing device (4) executing a videoconferencing software for communicating with a remote location (5), theapparatus (1) being further configured to be locally wirelesslyconnected to a plurality of local mobile devices (10) each comprising adisplay and a video camera (11), the apparatus (1) comprising: receivingmeans configured to receive a plurality of video signals (9) related toa plurality of participants in a video conference generated by the videocameras (11) of the plurality of local mobile devices (10), generatingmeans (17) operatively connected to said receiving means and configuredto generate an output video communication stream (6) based on saidplurality of video signals (9) received from the plurality of localmobile devices, and transmitting means configured to transmit the outputvideo communication stream (6) to said local processing device (4)executing the video conferencing software, characterized in that theapparatus (1) is configured to receive from the local processing device(4) an output signal (15) representing a content displayable by adisplay of the processing device (4) when executing the videoconferencing software, wherein the apparatus (1) comprises sharing means(18) configured to share with the plurality of local mobile devices (10)the output signal (15) representing the content displayable by thedisplay of the processing device (4).
 2. Apparatus (1) according toclaim 1, wherein the apparatus (1) is wirelessly connected with thelocal mobile devices (10) via an IEEE 802.11 interface for receiving thevideo signals generated by the video cameras (11) of the plurality oflocal mobile devices (10) and for sharing the output signal (15)representing the content displayable by the display of the processingdevice (4).
 3. Apparatus (1) according to claim 1, wherein the apparatus(1) is wirelessly connected with the local processing device (4)executing the video conferencing software via an IEEE 802.11 interfacefor transmitting the output video communication stream to the localprocessing device (4) and for receiving the output signal (15)representing the content displayable by the display of the processingdevice (4).
 4. Apparatus (1) according to claim 1, wherein the apparatusis further configured to receive from the local processing device (4) avideo signal generated by a video camera associated to the localprocessing device (4), the generating means (17) being configured togenerate said output video communication stream (6) also based on thevideo signal received from the local processing device (4).
 5. Apparatus(1) according to claim 1, characterized in that said generating means(17) are configured to combine said plurality of video signals (9)received from said plurality of local mobile devices (11) into a singlevideo communication stream (6) representing a video including apatchwork of moving images captured by different local mobile devices(10).
 6. Apparatus (1) according to claim 1, being configured forreceiving a framing command from one of mobile devices (10) or from theprocessing device (4), the framing command being inputted by a user ofthe mobile device (10) or of the processing device (4), wherein thegenerating means (17) are configured to change a mode of generating theoutput video communication stream (6) depending on a recognition of aspeaker based on the received framing command.
 7. System for videocommunication comprising: an apparatus (1) according to claim 1, a localprocessing device (4) executing a video conferencing software forcommunicating with a remote location, and a plurality of local mobiledevices (10) each comprising a display and a video camera, the apparatus(1) being locally connected to the local processing device (4) executingthe video conferencing software, and being locally wirelessly connectedto the plurality of local mobile devices (10).
 8. System for videocommunication according to claim 7, wherein each of local mobile devices(10) is anyone among a notebook, a laptop, a mobile phone and a tablet.9. System for video communication according to claim 7, wherein thelocal processing device (4) executing a video conferencing software isanyone among a personal computer, a notebook, a laptop, a mobile phoneand a tablet.
 10. System for video communication to claim 7, wherein thelocal processing device (4), when executing the video conferencingsoftware, is configured to exchange video conferencing signals includingthe output video communication stream with the remote location (5) viainternet, wherein the local processing device (4), when executing thevideo conferencing software, is configured to access the internet viathe apparatus (1).
 11. Apparatus (1) for video communication, able toproduce at output a video communication stream (6), comprisingprocessing means (7) and operatively associable with image acquisitionmeans (11) able to produce at least a video signal (9) related to aplurality of participants in a video conference, characterized in thatit comprises: recognition means (13) to recognize at least a speakeramong said participants in the video conference, starting from said atleast one video signal (9), and generating means (17) of said videocommunication stream (6) operatively connected to said processing means(7) and to said recognition means (13) and able to generate said videocommunication stream (6) starting from said at least one video signal(9) and depending on the signal at output from said recognition means(13).
 12. Apparatus (1) according to claim 11, characterized in thatsaid recognition means (13) comprise at least a software processingmodule of said at least one video signal (9) acquired for theidentification of said speaker.
 13. Apparatus (1) according to claim 11,characterized in that said recognition means (13) comprise at least asoftware processing module of said at least one video signal (9) for thedetection of at least one of: the lip movements of at least one of theparticipants in the video conference, at least a predefined gesture fromat least one of the participants in the video conference.
 14. Apparatus(1) according to claim 11, characterized in that said generating means(17) comprise at least a mixing module for the composition of said atleast one video signal (9) coming from said acquisition means (11) intoa single video communication stream (6).
 15. Apparatus (1) according toclaim 14, characterized in that said video communication stream (6)composed of said mixing module comprises: exclusively the video signal(9) relating to the only speaking participant identified; the mixing ofthe video signals (9) of all participants with the speaker highlighted;the mixing of the video signals (9) relating to all participants withequal emphasis.
 16. Apparatus (1) according to claim 11, characterizedin that said image acquisition means (11) comprise at least a videocamera (11).
 17. Apparatus (1) according to claim 11, characterized inthat said image acquisition means (11) comprise at least one of: atleast a webcam (11); at least a video camera (11) associable with atleast a mobile device (10); at least a video camera (11) built in atleast a mobile device (10).
 18. Apparatus (1) according to claim 11,characterized in that it comprises management and control means (14)operatively connected to said processing means (7) and suitable for themanagement and control of the video signal (9) acquired by said videocamera (11) of the mobile device (10).
 19. Apparatus (1) according toclaim 18, characterized in that said management and control means (14)comprise at least a software program installed on said mobile device(10).
 20. Apparatus (1) according to claim 11, characterized in that itcomprises sharing means (18) operatively connected to said managementand control means (14) for the sharing on said at least one mobiledevice (10) of at least an output signal (15) relating to the desktop ofa personal computer (4) operatively connected to said apparatus (1) andhaving a video conferencing software. 21.-23. (canceled)